Text corpus
LARGE AND STRUCTURED SET OF TEXTS BEING THE BASIS FOR LINGUISTIC RESEARCH
Textome; Text corpora; Language corpus; Linguistic corpus; Text item; Text data; Textual data; Multilingual corpus; Corpus of text; Textual corpus
In linguistics, a corpus (plural corpora) or text corpus is a language resource consisting of a large and structured set of texts (nowadays usually electronically stored and processed). In corpus linguistics, they are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.